The Annals of Statistics 

2006, Vol. 34, No. 4, 1987-2014 

DOI: 10.1214/009053606000000434 

© Institute of Mathematical Statistics, 2006 

WEIGHTED APPROXIMATIONS OF TAIL COPULA PROCESSES 
WITH APPLICATION TO TESTING THE BIVARIATE 
EXTREME VALUE CONDITION 

By John H. J. Einmahl, Laurens de Haan and Deyuan Li 1 

Tilburg University, Erasmus University and University of Bern 

Consider n i.i.d. random vectors on M 2 , with unknown, common 
distribution function F. Under a sharpening of the extreme value con- 
dition on F, we derive a weighted approximation of the correspond- 
ing tail copula process. Then we construct a test to check whether 
the extreme value condition holds by comparing two estimators of 
the limiting extreme value distribution, one obtained from the tail 
copula process and the other obtained by first estimating the spec- 
tral measure which is then used as a building block for the limit- 
ing extreme value distribution. We derive the limiting distribution of 
the test statistic from the aforementioned weighted approximation. 
This limiting distribution contains unknown functional parameters. 
Therefore, we show that a version with estimated parameters con- 
verges weakly to the true limiting distribution. Based on this result, 
the finite sample properties of our testing procedure are investigated 
through a simulation study. A real data application is also presented. 



1. Introduction. 

1.1. The extreme value model and its use. Let (X, Y), (X\, Y\), . . . , 
(X n ,Y n ) be i.i.d. random vectors with continuous distribution function (d.f.) 
F. Suppose that there exist norming constants a n , c n > and b n , d n 6 IR such 
that the sequence of d.f.'s 

/ maxi<i< n Xi - b n maxi< i<n Y { - d n \ 
P[ ^ <x, ^ <yj 
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converges to a limit d.f., say, G(x,y), with nondegenerate marginal d.f., that 
is, 

(1.1) lim F n (a n x + b n ,c n y + d n ) = G(x,y) 

n — >oo 

for all but countably many x and y. Then, for a suitable choice of a n ,b n ,c n 
and d n , there exist 71,72 £ M such that 

G(x, 00) = exp(-(l + 7i^)" 1/71 ), G(oo, y) = exp(-(l + 722/)" 1/72 )• 

The d.f. G is called an extreme value d.f. and 71,72 are called the (marginal) 
extreme value indices. 

Any extreme value d.f. G can be represented as 



G 

(1.2) 



x~^ - 1 y-T 2 - 1 
7i ' 72 

= exp - / (s(lAtan0))V(j/(lAcot0))*(d0) 







with $ the d.f. of the so-called spectral measure. There is a one-to-one 
correspondence between extreme value d.f.'s G and finite measures with d.f. 
$ that satisfy 

/•tt/2 /-tt/2 

/ (1 Atan0)$(d0) = / (1 Acot0)$(d0) = l, 
JO Jo 

via (1.2). 

Alternatively, one can characterize the extreme value d.f.'s G by the fol- 
lowing: there is a measure A on [0, oo] 2 \ {(00,00)} such that, with 

/ T -7i _ 1 ?/ -72 _ 1 \ 

(1.3) l(x,y):=-logG( V - , 

V 71 72 / 

we have 

1. l(x, y) = M{(u, v) € [0, oo] 2 : u < x or v < y}), 

(1.4) 



2. I (ax, ay) = al{x,y) fora,x,y>0. 

More generally, we have for any a > and any Borel set A C [0,oo] 2 \ 
{(00,00)} 

(1.5) A(aA) = aA(A), 
with aA := {(ax, ay) : (x, y) £ A}. Also, (1.1) implies 

(1.6) y ^K 1 " Fl ( x )' 1 " F2 ( y )) G ^) = 

for any Borel set A, provided A(dA) = 0, where F\(x) := F(x,co) and 
^2(2/) -=F(oo,y). See, for example, [2]. 
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The bivariate extreme value framework is the appropriate one when one 
wants to estimate the probability of an extreme set, that is, a set out- 
side the range of even the largest observations. Take a > small. Since 
by (1.5) and (1.6), for small t, 

P((l - F 1 (X), 1 - F 2 (Y)) e taA) « oP((l - Pi (A), 1 - F 2 (Y)) G tA), 

we can estimate the probability of tA — outside the range of the observations — 
asymptotically by estimating the probability of the pulled back set taA using 
the empirical measure. See [3]. Condition (1.1) is fulfilled for many standard 
distributions but not for all distributions. Hence, before using this framework 
to estimate probabilities of extreme sets, it is important to check whether 
(1.1) is a reasonable assumption for the data set at hand. And one wants 
to do this beforehand, without specifying the exact structure of the limiting 
distribution. 

1.2. Estimation of model parameters. Now, in order to develop a test, 
let us consider the following. Relation (1.1) implies [cf. (1.6)] 

Kmt- l P((l - Pi (A)) A (1 - F 2 (Y)) < t, 

(1.7) 

1-F 2 (Y) < (l-Pi(A))tan6») = $(6>) 
for continuity points 6 S (0, 7r/2] of Also, for (x,y) £ [0,oo) 2 , 
(1.8) limi _1 P(l - Pi (A) <tx or 1 - F 2 (Y) < ty) = l(x,y). 

A nonparametric estimator for <3?, suggested by the limit relation (1.7) is 



1 n 



[8] 

(1.9) $(0) := T^I{RfyRY> n +l-k, n+l-Rf <(n+l-R? )tan0}' 

i=l 

where Rf is the rank of Aj among Ai , X 2 , . . . , X n and Rj is the rank of Y{ 
among Y\,Y 2 , . . . ,Y n , where k = k(n) is an intermediate sequence of integers, 
that is, k — > oo, k/n — > 0, as n^oo. Similarly, a nonparametric estimator 
for /, suggested by the limit relation (1.8), is ([9]; see also [6]) 

1 n 

h(x,y) ■= -^22^{Xi>X n+1 _ [kxl . n or Yi>Y n+1 _ [ky] . n } 

(1.10) 

1 - 

= T^2^{Rf >n+l-kx or RY >n+l~ky} ' 
i=l 

where X\ :n < • • • < X n - n are the order statistics of the Aj, i = 1,2, . . . ,n 
(similarly for the Yi), with \z] the smallest integer > z. 
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Another way of estimating I is via (1.2), (1.3) and (1.9): 



(1.11) h(x,y):= (x(lAtan0))V(y(lAcot0))$(d0). 



/■ tt/2 



JO 



1.3. The test. A promising approach to testing whether the null hypoth- 
esis (1.8) holds seems to be to see if the two estimators l\ and I2 for I, that 
have a different background, are not too different. The estimator I2 is a 
natural one mimicking more or less the tail of the distribution itself. But 
this estimator does not necessarily satisfy condition 2 of (1.4). On the other 
hand, l\ does satisfy condition 2 of (1.4), but the estimator itself is of a 
somewhat more complicated nature. 

The proposed test statistic is of the Anderson-Darling type: 



for certain f3 > 0. The test statistic is similar to those used for testing a 
parametric null hypothesis (like testing for normality), where the empirical 
distribution function is compared with the true distribution function with 
estimated parameters. Here, however, the estimated parameter $ is a func- 
tion (and we only deal with the tail of the distribution). Also, note that our 
methods allow us to deal with test statistics other than L n as well. 

It is not difficult to see that if relation (1.8) is true, the statistic L n tends to 
zero in probability as n — ► 00. We shall establish the asymptotic distribution 
of kL n as n — > 00 under (1.1) and some extra conditions stemming from [9] 
and [8], thus providing a basis for applying a test. The hypothesis to be 
tested is (1.8). For the asymptotic normality of the test statistic kL n , under 
Hq, extra conditions are needed. See Remark 2.2 below. 

Note that this test checks whether the dependence structure satisfies (1.8) 
and not if the marginals F\,F2 are of the right type. It is only based on 
the relative positions (ranks) of the data and completely independent of 
the marginal distributions for which tests have been developed already in 



1.4. Use of test. As mentioned before, the test statistic kL n can be used 
for a preliminary test of the extreme value model (1.1) before one uses the 
model in applications. Note that the test statistic kL n is based on observa- 
tions for which at least one component exceeds a certain threshold. Since 
the estimators depend on this threshold, one can plot kL n as a function of 
k. This plot can be used as an exploratory tool for determining from which 
threshold on the two estimators l\ and I2 are close to each other, suggesting 
that the approximations (1.7) and (1.8) can be trusted, and, hence, yields 
a heuristic procedure for determining k. So this is a second use of the test 
statistic kL n . See also [14], Section 5.4, and [1]. 



(1.12) 




[5] and [4]. 
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1.5. Outline of the paper. The weak convergence of kL n is stated in 
Theorem 2.3. For the proof of this theorem, the known asymptotic normality 
result for $ [8] is sufficient but not the known one for l 2 [9]. Hence, as a 
preliminary but important result, we first develop a Gaussian approximation 
for the weighted tail copula process on (0, l] 2 , 

Vk{l 2 {x,y) - l(x,y))/(x Vy)", < v < 1/2, 

thus extending significantly the result of [9], where ij = 0. This result is 
stated in Theorem 2.2. The proofs are given in Section 3. 

The limiting random variable in Theorem 2.3 is determined as an integral 
of a combination of Gaussian processes. They are parametrized by func- 
tions which can be estimated consistently. In Section 4 it is proved that the 
probability distribution of the limiting random variable with these functions 
estimated converges to the distribution of the limiting random variable with 
these functions equal to the actual ones, which makes the procedure appli- 
cable in practice. In Section 5 simulation results and an application to real 
data are reported. 

2. Main results. Before stating the main results, we need to introduce 
some notation. 

Let W\ be a Wiener process indexed by the Borel sets in [0, oo] 2 \{(oo, 00)}, 
depending on the parameter A from (1.4) in the following way: W\ is a 
Gaussian process and for Borel sets C and C, 

EW A {C) = and EW A (C)W A (C) = A(C n C). 

Define the sets Cg by 

C e = {(x,y) G [0, oo] 2 :x Ay < l,y < xtan0}, 6 G [0,vr/2]. 

Assume that the measure A has a density A. The process Z on [0,7r/2] is 
defined by 

Z(6)= / X(x,xtan6)(W 1 (x)tane -W 2 (xtan6))dx 

Jo 



poo 

W 2 (l) / X(x,l)dx 

JlWl/ttand) 



(2.1) 



'lVl/(tan6») 

^/4,V2](W(l) / Khy)dy, 0e[O,7r/2), 



'( I ) -w 2 



/oo /"CO 
\(x,l)dx-Wi(l) J X(l,y)dy, 



.2, 

where Wi,W 2 are the marginal processes defined by 

Wi(aO = WA(Mx[0,oo]) and W 2 (y) = Wa([0,oo] x [0,y]). 



6 J. H. J. EINMAHL, L. DE HAAN AND D. LI 

Clearly, both processes are standard Wiener processes. Define, for x,y > 0, 

(2.2) R(x,y) = A([0,x] x [0,y]), 

(2.3) W R (x,y) = W A ([0,x]x[0,y}), 

(2.4) R 1 (x,y) = dR(x,y)/dx, R 2 (x,y) = dR(x,y)/dy. 

For convenient presentation and convenient application, the next two the- 
orems are presented in an approximation setting (with all the processes in- 
volved defined on one probability space), via the Skorohod construction. So 
in these theorems, l\, l 2 and the limiting processes A and B (defined below) 
are only equal in distribution to the original ones, but we do not add the 
usual tildes to the notation. 



Theorem 2.1. Assume that condition (1.6) and Conditions 1 and 2 of 
[8] hold, and that A has a continuous density A on [0, 00) 2 \ {(0, 0)}. Then 

\Vk(h(x,y) - l(x,y)) - A(x,y)\ P 

Slip ^ U 

o<x, y <i xVy 
as n — > 00 , where 

' 7T 



A(x,y) :-- 



x[W A (C n/2 ) + Z 



+ y 



arctan y/x ^ 



ir/4 



x[W A (C„ /2 ) + Z 



sin 

7T 



:(W A (C e ) + Z(0))de, if y>x, 



tt/4 
arctan y/x 



1 



cos 2 9 



(W A (C e ) + Z(9))d9, if y<x. 



Write Ui = l- F 1 (X i ),V i = 1 - F 2 (Y i ), % = 1,2, . . . ,n. Let C(x,y) be the 
distribution function of (Ui,Vi). By (1.6) and (2.2), we have 

R(x,y) =limt- 1 C(tx,ty). 

We assume, as in [9], that, for some a > 0, 

(2.5) r l C{tx, ty) - R(x, y) = 0{t a ) ast|0, 

uniformly for x V y < 1, x, y > 0. 

Theorem 2.2. Assume that conditions (1.6) and (2.5) hold and that 
k = o(n 2a /( 1+2a )) . If R\ and R 2 are continuous, then we have, for < r/ < 
1/2, 
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as n — ► oo ; where B(x,y) := Wr(x, y) — R\{x,y)W\{x) — R.2(x, y)W2(y)- 

Theorem 2.3. Assume the conditions of Theorems 2.1 and 2.2 hold. 
Then for each 0<f3<3, 



as n — > oo, and the limit is finite almost surely. 

Remark 2.1. The case j3 = is similar to the Cramer-von Mises test. 
Note that for (5 < 2, Theorem 2.3 easily follows from an unweighted approx- 
imation in Theorems 2.1 and 2.2. Therefore, the case (5 = 2 (and not, as 
usual, (3 = 1) is similar to the Anderson-Darling test. 

Remark 2.2. Conditions 1 and 2 of [8] are rather involved. They require 
that the convergence in (1.6) hold uniformly over certain classes of sets. 
Moreover, they put an extra restriction on the growth of the sequence k(n), 
related to the rate of that convergence. The assumption that A has a density 
A excludes, for example, asymptotic independence, that is, l(x,y) = x + y, 
for all x,y > 0. Condition (2.5) is rather weak, but there are distributions 
for which (1.1) holds with asymptotic dependence, but where the rate of 
convergence is slower than t a for any a > 0. 

Remark 2.3. The random variable on the right in Theorem 2.3 has a 
continuous distribution function. This follows from a property of Gaussian 
measures on Banach spaces: the measure of a closed ball is a continuous 
function of its radius; see, for example, [13], Chapter 4, Theorem 1.2. 

Remark 2.4. Since x V y < l(x, y) <x + y < 2(x V y), (2.6) remains true 
with x V y replaced with l(x,y) or x + y, but, when choosing l(x,y), the 
left-hand side of (2.6) is not a statistic and I has to be estimated. 

3. Proofs. Before proving Theorem 2.1, we first present two lemmas and 
a proposition. 

Lemma 3.1. 



(2.6) 





(A{x,y)+B(x,y)) 2 
(xVy)P 



dx dy 




if y>x 



if y<x. 
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PROOF. Since 

f n/2 

l(x,y)= / (x(lAtan0))V(y(lAcot0))$(d0) 
Jo 

7r/4 /' 7r /2 

(xtanO) \Jy<S>(d6) + / x V (ycot 6)®{dB) 

Jtt/4 

and x tan 9 > y x > y cot 9 44> > arctan | , we have 

n7r/4Aarctany/a: rn/4 



rir 1 4/\arctany / x 

l(x,y)= / y${d9) + I ' xtan0$(d0) 

JO J it / 4Aaxcta,ny / x 

/•7r/4Varctan j//:r /" 7r /2 

+ / ' ycot0<l>((i0) + / ' x$(d9) 

Jn/4: J tt j '4Varctany /x 

rn /4 />arctan y/x r n /2 

/ y<5>{d9) + I ycot9<$>{d9) + I ' x<S>(d9), 

JO Jit/4 ./ arctan ty/x 



7r/4 Jarctanj//x 

if y > x, 

/ y${d9) + / ' xtan0<I>(cZ0) + / x$(d0), 

jO Jarctany/x Jtt/A 

if y < x. 

In the case y > x, via integration by parts, one has 
l(x, y) = V^^J^J ~ 2/^(0) + y cot ^arctan — ^<I> ^arctan — J — ycot f 

/■arctan^/z / ]_ \ /^x / y N 

- s L •<»> J * + *•( a ) - ^ ( mtim i, 

/-arctan y/x 1 

2/ Jtt/4 sm z 

In the case y < x, via integration by parts again, one has 

( U\ 7T / 7T 

l(x, y) = y<I> arctan — — y$(0) + xtan — <I> — 
V x J 4 V 4 

— x tan ( arctan — J $ ( arctan — 
\ x J V ^ 

-x [ n/i ^0)—L- ( l9 + x$[-)-x<f> l 

Jaictany/x COS* 1 9 \2 J \4 

= x*(-]-x/ — T -<S>(6)d9. 

\2J J arctan y/i COS^ U 



7T \ T / 7T 



Write 



(3.1) R n (x,y) = -Ci—,—J, T n (x,y) = -J2 I {U i <kx/n,V i <ky/n}, 

i=l 



TESTING THE BIVARIATE EVT CONDITION 

(3.2) v n (x,y) = Vk(T n (x,y) - R n (x,y)), v n)V (x,y) = ? n ^'^ 

[x V yyi 

and 

_ v n (x,oo) 

Vn,T),l{%) — „ , 

(3.3) 

v n {oo,y) . 

Vn,ri,2[y) = -JJ 1 V n j = V nfi j,J = 1,2. 

Proposition 3.1. Lei T > 0. For < ?? < 1/2, 

( v n,ri( x ,y),x, y G (0, T], f njJ)i i(x), x £ (0,T],t>„ )J7)2 (y),y G (0,T]) 
converges in distribution to 

f W R (x,y) W 1 (x) W^y) 

as n — > oo . 

Proof. Define 

2n,j = —r=$((n/k)Ui,(n/k)Vi), 

and for all < x, y < T, define the functions 

= l[0,x)x[0,y)/ ( x V 
fx = I[0,x)x[0,oo]/x T1 , fy = I[0,oo]x[0,y)/y V ■ 

All these /'s form the class T . We equip T with the semi-metric d defined 
by 



Mf f ^_ Lf W R (x,y) W R (u,v) \ 2 

and so on. 

For any e > 0, the bracketing number iVn(e, J-") is the minimal number of 

sets N £ in a partition J 7 = Uj=i^ej of the index set into sets T e j such that, 
for every partitioning set T e 



(3.4) ]T£* sup \Z n ^f)-Z n ^g)\ 2 <e< 

1=1 f,g&Fej 
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where, as usual, Z n ^(f) = J f dZ n ^ and E* means taking the outer integral 
when computing the expectation. 

We will use Theorem 2.11.9 in [17]: For each n, let Z n ^,Z n ^-, ■ ■ ■ , Z n ^ n be 
independent stochastic processes with finite second moments indexed by a 
totally bounded semimetric space (J r ,d). Suppose 

n 

^ J B*||Z ni i||^l { || Zn i ||^ >A} ^0 for every A > 0, 

i=l 

where = sup^ 6:F \Z n>i (f)\, and 

J iJlogN^e,?) de^O for every 5 n j 0. 

Then the sequence J2i=i{Zn,i ~ EZ n: i) is asymptotically tight in l 00 ^) 
and converges weakly, provided the finite-dimensional distributions converge 
weakly. 

We briefly sketch the total boundedness of (J-,d). We only consider the 
subclass J~2 of J- consisting of the bivariate f X y's; moreover, we restrict 
ourselves to the case x>y, u>v and x>u, y > v . For any 5 > 0, assuming 
\x — u\ <5 and \y — v\ <5, one has 

■a,. f ,_^f W R {x,y) W R {u,v) s 

a \Jx,yiJu,v) — i \ , 



V (iVj/)" (uVd)i 
u r >W R {x,y)-xVW R (u,v)^ 2 



u 2r 'R(x, y) - 2x r 'u r !R(u, v) + x 2r iR(u, v) 



(xu) 



2'/ 



If u < 5, then 



„ R{x,y) , 2R(u,v) i 



^jj u 2r? M 2n 

< x 1 " 257 + Zu l - 2r > < (25) 1 ~ 2ri + 35 1 - 211 < Stf 1 " 2 ''. 

If ii > 5, then, since 

< R(u,v)+A([u, x] x [0, oo]) + A([0, oo] x [v, j/]) < i?(u, u) + 25, 

we have 

2 flfr.ttX^-s'Q 2 2^ 

" \Jx,yi Ju,vj _ / n 9 _ "T 



(xu) 2r] (xu) 2r > 
^u 1 "^^ -x v ) 2 + 25^ 2v 
<u l ^x 2 ^ 2 {x-u) 2 + 25 1 ~ 2r ' 
< u-^ix - uf + 25 1 ~ 2r ' < S^ 1 " 2 ''. 
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So, since 1 — 1r\ > 0, we see that, for every e > 0, we can find a 5 > such 
that, for \x — u\ < 6 and \y — v\ < 5, d 2 (f x ,y,fu,v) < £• Hence, since [0,T] 2 is 
totally bounded with respect to the Euclidean metric, we obtain the total 
boundedness of (F, d) . 
Observe that 

Z n ,i(fx,y) = ^7=I{Ui<(k/n)x,Vi<{k/n)y}/( x V vT \ 

n 
i=l 

and similarly for the marginal processes. First we have to show that, for 
every A > 0, 

n 

(3-5) Y. E W Z nA\rI { \\ Zn A\r>X}^ Q 

i=l 

as n — > oo. Again, we will restrict ourselves to the subclass J~2- For the 
univariate /x^'s and fy s, it can be shown in a similar but easier way. 
Note that 

^U^/n^Wn^ V yf < ^p^, 
so for each A > 0, 



Y,E\\Z n ,i\\r 2 L 



< 



n,i||^ 2 J {||^„,i|b 2 >A} 
1 



Vfc jB ((nA)(?7 1 V ^))r, J {(n/fe)(f/iVV 1 )<(VfeA)-V^ 



x 1 aL\ —x, —x 



n 

+ rj Li —x, —x )x 1 ax 

Jo \n n J 

<^(^k X -(VkXy 1/ri + r ] [ iV ~ kX) ^-x'^dx) 
\/k\ n J n J 

= A i-i/ U i-i/(2„) + ^^-(v^A) 1 - 1 ^ 

1 — Tj 

= J—\^ k l~l/(^)^ ( ?? <l/2). 
1 — 1] 
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Next, we want to prove 

(3.6) J 5 J JlogN^Fjde-tO 

for every S n I 0. For notational convenience, we choose T = 1; for general 
T > 0, the proof goes the same. Let e > be small, define a = £ 3 /( 1-2 *?) and 
9 = 1 — e 3 . We again consider only J~2\ the univariate /'s are easier to handle. 
Define 

Tia) = {f x ,y £ J~2-x Ay < a}, 
T(l, m ) = {f x , y G T 2 : # m < x < 9 l , 6 m+1 <y< 6 m }. 

Then 

/ [log a I log 6] [log a/ log 8\ \ 
\ m=0 1=0 ) 

First check (3.4) for J- (a): 

n 

sup (Zn ti (f) - Z n! i(g)) 2 
i=l f,geF(a) 

= nE sup (Z ni i(/) - Z n .i(/j)) 2 
<4n£ sup Z^f) 

/&F(a) 

= — £ sup I^Kkx/ny^ky/n}/^ V y) 2v 

xAi/<a 

4n „/n \~ 2 \ 

"T U / / {( n / fc ) £/ 'i< > 

4n f ak l n fn \~ 2r? , 
-x ax 



2 



k Jo \k 
4 



^ < e 



1-2?/ 

Now we consider (3.4) for the ^"(^m); w.l.o.g. we take / < m: 

n 

^£ sup (Z n>i (f) - Z riii (g)) 2 
i=l f,9&F(!,m) 

<nE[ sup Z nl (f)- inf Z n \{f) 
\fer(i,m) fer(i,m) 
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- T E ( I {Ui<(k/n)8 l ,V 1 <(k/n)8 m }/(® l+1 V m+1 ) n 



I{u 1 <(k/n)e l + 1 ,v 1 <(k/n)e m + 1 } I v m ) v ) 2 



n 



fr E [ Vi<(i/«)» , ,VK(fc/n)e"'} Ur,(i+1) ~ QVI 



i y 

+ (^{U 1 <(k/n)e l ,V 1 <(k/n)9 m } ~ ^{(7i<(fc/n)0 i + 1 ,Vi<(fc/n)6»'™+ 1 })^7 J 



n n 



y A 2 "' 



< 



+ 

2n/k 9 l 



CI 



n n 



1 



2 2A: 0< 
1 + 



# 2 ^ 



(1-0)) 



< 2 



1 



0V2 



1 + 4(1 - 0) < e b + 4e 6 < e A 



It is easy to see that the number of elements of the "partition" of Ti is 
bounded by e~ 7 , which yields (3.6). Hence, we have proved the asymptotic 
tightness condition. 

It remains to prove that the finite-dimensional distributions of our process 
converge weakly. This follows from the fact that multivariate weak conver- 
gence follows from weak convergence of linear combinations of the compo- 
nents and the univariate Lindeberg-Feller central limit theorem. It is easily 
seen that the Lindeberg condition is satisfied for these linear combinations 
since the elements of T are weighted indicators and, hence, bounded. □ 



Lemma 3.2. For < ?? < 1/2 
\W R (x,y)\ 



P[ sup 

\ x Vy<e {xVyYl 
x,y>0 



> A ) < 16 ex P 

m=0 



A 2 2 m ( 1 - 2r i) 

2 l+2r, £ 1-2 V 



Proof. For m = 0, 1, 2, . . . , define 

Am = \ (x, y) : — %rr < x < , rr < V < £ )•■ 

m i v 2m+l — — 2 m 2 m + 1 ~~ ~~ ' 

Then, with Z a standard normal random variable, 



P( sup Eff 

0<x<y 



> A] =P[ sup 

0<a;<y 



> A 
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\W R {x lV )\ 

< P sup sup = > A 



ie{0,l,2,...} (x,y)eA m V 



<]TP SU P \W R (x,y)\>\ 

m=0 ^(x,y)eA m 



£ 



m=0 
oo 



Wr(—,s 



> A 



n 



2»Tt+l 



<4 p [\ zl ; 

m=0 

oo 

<8]Texp(-^ 

m=0 



A /2 m \ 1 / 2 ~ r i 

A 2 2 m ( 1 -2r?) 



2 l+2r, £ l-2r/ 



where the third inequality follows, for instance, from an adaptation of Lemma 1.2 
in [12] and the last inequality from Mill's ratio. A symmetry argument com- 
pletes the proof. □ 

By Theorem 2 in [8] and Proposition 3.1 (and their proofs), it follows that 
(Vk($(9) - $(9)) 

v) («)) 



W A (Cg) + Z(6) 



W R (x,y) W^u) W 2 (v) 
(xVy)i ' u* ' vv 



on I?[0,7r/2] x D[0,T] 2 x D[0,T] x D[0,T]. By the Skorohod construction, 
there exists now a probability space carrying v*, t>* l5 2 , W^(C.), 
W R , W{ and W 2 * such that 

(W* A (C), z*,w* R , wz,wt) = (W A (C), Z, W R , W!,W 2 ) 
and for < rj < 1/2, 

(3.7) D n := sup |\/fe(6*(0)-$(0))-TO(C e ) + Z*(e))|=op(l), 

O<0<7r/2 



(3.8) 



(3.9) 



sup ^- W ^ =o P (l), 

\v* n ^x)-Wt(x)\ 
sup — ■ =Op(l), 

0<x<T & 

K, 2 (y)-w?(y)l m 

sup ! = o P (l), 

o<j/<t y 7 ' 
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as n — > oo. Henceforth, we will work on this probability space, but drop the 
* from the notation. 

Proof of Theorem 2.1. By Lemma 3.1, 
Vk(h(x,y) - l{x,y)) 

xVk(& 



\ Ml 



x Vk(U\ )-* (I 



arctany/x 1 

+ y / —^Vk{$>{6)-${e))de, 



tt/4 sm 

vr/4 



arctany/a; COS 



if y > x, 
if y < x. 



First consider the case y > x: 

Vk(h(x,y) - l(x,y)) - A(x,y) 



sup 

0<x<y<l 



x V y 



1 



sup 



o<x< y <i xVy 



raxctaay / x i 

-(WA(C fl ) + Z(0)))dfl 



xD n yD n [*/ 2 1 ,„ 
< sup h sup / — 5— do — ► 0, 

o<x<y<i^Vy o<a:<2/<i » V y 7^/4 shr 6> 

in probability as n-> oo. For the case y < x, the proof is similar. □ 

Let Qi n and Qin be the empirical quantile functions of the {C/j}f =1 and 
{Vi}" =1 , respectively. Define 



1 n 

R(x,y) = t2J I {Ui<Qi n {kx/n),V i <Q2n 



(ky/n)}- 



i=l 



Note that, by (1.10), 

1 - 

l 2 (x,y) = -^L 



T 1 {U i <Qi„(kx/n) or V 1 <Q 2n (ky/n)}- 
i=l 



Proof of Theorem 2.2. It is easily seen that h(x,y) + R(x,y) = 
(\kx~\ + \ky\ — 2)/k < ([kx] + [ky])/k, for each x,y £ (0,1], almost surely. 
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So we have 

I Vk(l 2 (x, y) - l(x, y)) + Vk(R(x, y) - R(x, y))\ 
sup 

o<x, y <i (xVyp 

xVy>l/k 

\^k({l/k)(\kx] + \ky]-2)-(x + y))\ 

= SUp -, r 

o<x, y <i (xVyp 

xVy>l/k 

<k v sup Vk(x + y — ([kx] + [ky])/k) 

0<x,y<l 

<2^k-k r i- 1 = 2k r '- 1 / 2 -^0. 
Write Sj n {x) = ^Qjn(^x), j = 1,2. Then we have 

\Vk(l 2 (x, y) - l(x, y)) + W R (x, y) - R 1 (x, y)W 1 (x) - R 2 (x, y)W 2 {y)\ 

SUp -. r 

o<x, y <i (xVyp 

xVy>l/k 

=' sup \Vk(R(x,y) - R{x,y)) -W R (x,y) 

0<x,y<l 
xVy>l/k 

+ R 1 (x,y)W 1 (x) + R 2 (x,y)W 2 (y)\(x V y)^ + o(l) 
\Vk(R(x,y) - R n (S ln (x),S 2n (y))) - W R (x,y)\ 

= SUp -, r 

o<x, y <i (xVyjV 

xVy>l/k 

\Vk(Rn(Sin(x),S2n(y))) - R(S ln (x) , S 2n (y)) I 

~r sup 

o<x, y <i (x\/y) T i 

xVy>l/k 

+ sup \Vk(R(S ln (x),S 2n (y)) - R(x,y)) 

0<x,y<l 
xVy>l/k 

+ R 1 (x,y)W 1 (x,y) + R 2 (x,y)W 2 (y)\(x V y)"" + o(l) 
= :D 1 + D 2 + D 3 + o(l). 

We will show that Dj — > in probability, j = 1, 2, 3. We have 

n _ _ \\fk{T n (Si n (x),S 2n {y)) - R n {S\n{x),S 2n (y))) - W R (x,y)\ 

J~y 1 — Slip , , 

o<x, y <i [xVyp 

xVy>l/k 

< sup \Vk(T n (S ln (x),S 2n (y)) 

0<x,y<l 
xVy>l/k 

- R n (S ln (x),S 2n (y))) - W R (S ln (x), S 2n (y))\ 
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WQ ( \w Q I W-V ( Sln{x)V S 2n {y) \ 

x (Sin (a) V S 2n (y)) '• 

V a; Vy / 



Sm(x)VS 2n (y)y 
x V y 

|^fl(Si„(x),S2n(y))-Wk(a,i 



+ sup 

0<z,y<l [XVyP 
xVy>l/k 

\v n (s,t)-W R (s,t)\ (Qln{s)VQ2n{t)y 

< SUp - • SUP 

0<s,t<2 (sVtp 0<s,t<k/n\ SVt / 

sVt>l/is 

\W R (S ln (x),S 2n (y))-W R (x,y)\ 

+ SUp -, r 

o<x >y <i [x\Zy)^ 

x\Jy>l/k 

= :D n - D 12 + D 13 , 

where the last inequality holds with arbitrarily high probability. Then D\\ — > 
in probability because of (3.8) with T = 2. It is well known that 

(3.10) sup 9uM = 0p (l), j = l,2 

s>l/n s 

(see [16], page 419). Hence, D\\ ■ D\ 2 — > 0, in probability. Now consider, for 
each e > 1/k, 

n , \W R (S ln (x),S 2n (y))-W R (x,y)\ 

-^13 < SUp 

0<a;,a<i £ ' 

xVy>£ 

+ sup \W R (S ln (x),S 2n (y))\ / Qm(s)VQ2n(*) y 
o<x, H <i (Si n (x) V S2n(y)) r > s,t>l/n\ / 

l/k<iVj<E — 

\W R (x,y)\ 

+ SUP — r 

o<x, y <i (xWyP 

l/k<xVy<e 

= :D U + D 15 + D 16 . 
By the (uniform) continuity of W R and the fact that 

71 

(3.11) sup -r\Qjn{t) -t\ ->0, a.s.,j = l,2, 

0<t<k/n fc 

-D14 — ► in probability a.s. for any e > 0. Let 5 > 0; by (3.10) and Lemma 3.2, 
we see that, for large n, P(D±s > 5) < 5 for e > small enough. Again, from 
Lemma 3.2, we have P(D\q > 5) < 5. Hence, D13 — > in probability and, 
consequently T)\ — > 0, in probability. 

Consider L> 2 - Take (a, 6) with a\lb = u. Then according to (2.5), 



-C(ta,tb) = —c(tu-,tu 
t ut \ u u 
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= ur( -,-) +u 1+a O(t a ) = R(a,b) + (aVb) 1+a O(t a ) 
\u u J 

Now with arbitrarily high probability, 

\Vk(Rn(x,y) - R(x,y))\ (Qm{s) V Q 2n (t) y 

L>2 < SUp r • SUp 



0<x,j/<2 (xVy) 7 ! sVt>l/n\ s Vt 

We have seen before that the second term of this product is Op(l). So it 
suffices to show that the first term is o(l): 

\Vk(R n (x,y) - R(x,y))\ ( Vk(x V y) l+a \ (( k 

SUp -. 7 = SUp -. r )(J — 

0<x,y<2 (lVl/)1 \0<x,y<2 [X V y)^ J \\n 

fk a+l l 2 \ 



by assumption. Hence, D2 — > in probability. 

It remains to show that -D3 — > in probability. By two applications of the 
mean-value theorem, we obtain 

R(Si n (x), S 2n {y)) - R(x, y) 

= R(Si n (x), S 2n (y)) - R(x, S 2n (y)) + R(x, S 2n {y)) - R(x, y) 
= Ri(0i n , S 2n (y))(Si n (x) — x) + R 2 (x,9 2n )(S 2n (y) - y), 

with 9\ n between x and S\ n {x) and 9 2n between y and S 2n (y). So 

n . iR^B^S^iy^VkjSmix) - x) + R^x^Wtjx)] 

o<x, y <i [xVyp 

xVy>l/k 

\R2(x, d 2n )Vk(S 2n (y) -y) + R 2 (x,y)W 2 (y)\ 

"T~ Slip . r 

o<x, y <i (xVyp 

xVy>l/k 

We consider only the first term on the right-hand side of this expression; the 
second one can be dealt with similarly. Write z n (x) = ^fk(S\ n {x) — x). Prom 
(3.9) with 77 = 0, it follows that sup 0<a; <i \z n (x) + Wi(x)| — ► in probability. 
From this, it can be shown that, for < rj < 1/2, 

(3.12) sup M*)+^(*)Uq 

l/k<x<l X 7 ! 

in probability (see, e.g., [7]). Now 

\Ri(9 ln ,S 2 n(y))z n (x) + Ri(x,y)W 1 (x)\ 

SUp r 

0<x,y<l (iVj/f 
x\Jy>l/k 
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. jj / a Q I W \z n {x)+Wi(x)\ 

< SUp Rl{6ln,S2n{y)) ■ SUp 

0<x,y<l l/k<x<l x ' 

\Wi(x) 

+ sup \Ri(x,y) - R 1 (9 ln ,S 2n (y))\ ■ sup 



0<x,y<l 0<a<l x ' 

= :D 31 + D 32 . 

Since R\ is continuous on [0, 2] 2 , it is uniformly continuous and bounded. 
This, together with (3.12), yields D 3 \ — > in probability. The uniform conti- 
nuity of R\, together with (3.11) and the fact that sup 0<a;<1 1 Wi(x)\/x v < oo 
a.s., yields D32 — > in probability and, consequently, D3 — > in probability. 

Finally, we show that 

|-\/fe(?2(z,y)-J(z,y)) + -B(z,y)| . . 

SUp -. r =Op{L). 

o<x, y <i/k (xVy) 7 ? 
Observing that sup 0<Xi j /< i/fc fa(%, y) = a.s., this follows easily. □ 

Proof of Theorem 2.3. For each < (3 < 3, there exist a £ [0, 2) and 
7] G [0, 1/2) such that (3 = a + 2rj. By Theorems 2.1 and 2.2, and Jq(x V 
y)~ a dxdy < 00, it follows that, as n — > 00, 

J Jo<x, y <i (x\Jy)P 

= op(l) / / 7 — — r dxdy 
J Jo<x,y<i {x\Jy) a 

+ ff (A(x,y)+B(x,y)) 2 
J Jo<x, y <i (xyyY 

S [[ ^ y ) + B ^ 2 dxdy. 

JJo<x,y<i (xVy)l 3 □ 



4. Approximating the limit. For testing purposes, we have to find the 
probability distribution of the limiting random variable in Theorem 2.3. This 
can be done by simulating the processes A and B, but unfortunately their 
distributions depend on the unknown measure A. Therefore, we generate 
approximations A n and B n , respectively, of the processes A and B, not 
with parameter A, but with approximated parameter A n . In this section 
we consider the convergence of the sequence of these approximated limiting 
random variables. Until further notice, we take {A n } n >i to be a sequence of 
deterministic measures. 
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R ln {x,y):=\k l / b K l {[x-k- l/ \x + k- 1 ">] x [0,y)), 
#2„(*,y):=^ 1/5 An([0,x) x [y-k-^^y + k- 1 / 5 ]), 
W Rn (x,y):=W An ([0,x)x[0,y)), 

W ln (x):=W An ([0,x] x [0,oo]), 

W 2n (y):=W An ([0,oo] x [0,y]), 

and the process B n by 

B n (x,y) := W Rn (x,y) - Ri n (x,y)W ln (x) - R 2n (x, y)W 2n {y). 

Based on the definition of Z in (2.1) and the homogeneity property of A [i.e. 
X(tx,ty) = j\(x,y)], we define the approximating process Z n by 



(4.1) Z n (6) = { 



A n ,(l,tan#)tan# 
- A re (l,tan0) 
-W 2n (l) 

A n (l/tan0,l) 



Vtanfl Wi n (x) 



dx 



1 W 2n (x) 



dx 



o x 

X n (x,l)dx, 0e[O,7r/4], 



1 Wi n (x) 



dx 



o x 



A n (l/tan6>, 1) 



1 



tan$ 



tan 9 



W 2n {x) 



dx 

tan# 



X n (l,y)dy, 



-W 2n (l)J X n (x,l)dx-W ln (l) 

0G(vr/4,7r/2), 

rOO roc 

-W 2n (l) X n (x,l)dx-W ln {l) X n (l,y)dy, 



9 = tt/2, 

where X n is the approximation of A defined by 

A n (l, y) : = ifc 1/3 A n ([l - AT 1 / 6 , 1 + AT 1 / 6 ] x[y- AT 1 / 6 , y + AT 1 / 6 ]), 



y>o, 



X n (x, 1) : = \k l ^K{[x - k~ x l\x + AT 1 / 6 ] x [1 - AT 1 / 6 , 1 + AT 1 / 6 ]), 



x>0. 
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Finally, define the process A n by 



arctany/x 1 



+ y 



A n (x,y) := < 



tt/4 



•it/A 



sm 

7T 

2 
1 



-Xw An (c e ) + z n (e))de, if y >x, 



:(W A „(C fl ) + Z n (0))d0, if y<x. 



/arctanj//x COS 2 $ 

First we consider the weak convergence of the weighted approximating 
processes. We write Z?2 := D[0, l] 2 for the generalization of D[0, 1] to dimen- 
sion 2, and Cd for the Borel <r-algebra on [D^^d), where d is the metric on 
Z?2 defined in [11]. 

Proposition 4.1. Let A be as in Theorem 2.3. Suppose that {A n } n >! 
is a sequence of measures on [0,oo] 2 \ {(oo,oo)} satisfying that, for each 
x,y> 0, 

(4.2) A n ([0,x] x [0,oo]) = [kx]/k, A n ([0,oo] x [0,y]) = [ky]/k, 



(4.3) 



sup \A n ([0,x] x [0,y])-A([0,x] x [0,y])\^0 

0<x,y<l 



as n — > oo. Further, suppose that 
(4.4) sup \X n (x,l) - A(x,l) 

0<x<l 



0, sup |A n (l,y)-A(l,y)H0, 

0<y<l 



(4.5) 



sup \Rj n (x,y)-Rj(x,y)\-*0, i = 1,2, 

0<x,j/<1 



as n — > oo. T/ien /or eac/i < ry < 1/2, 



(xVy) 7 ) 
weakly in D2 ■ 



,{x,y)e[0,lf 



A(x,y) + B(x,y) 
(x V y)^ 



(x,y)€ [0,1] 2 |, 



Before proving this proposition, we present three corollaries. The last one 
is the main result of this section. 

Corollary 4.1. Under the conditions of Proposition 4.1, for each 
0</3<3, 

(A n (x,y) + B(x,y)f dxdy 
io<x, y <i {xVy) p 

{A(x,y)+B{x,y)) 2 



(4.6) 



0<x,y<l 



{xVyf 



■ dx dy 
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as n — > oo. 



Let Qa„ be the quantile function of the random variable on the left-hand 
side of (4.6) and Q\ the quantile function of the random variable on the 
right-hand side of (4.6). 

Corollary 4.2. Under the conditions of Proposition 4.1, for each 
< f3 < 3 and for each continuity point 1 — a (0 < a < 1) of Q\, 

lim Qa„(1 -a)= Q A (1 - a). 

n — >oo 

Next, with abuse of notation, we estimate A n from the data, so it becomes 
random. In [8], A n is defined as 

i n ( \ n 1 n ^ 



A n (A) := rJ2 T kA/n{ -X) / (-«»A]( ?7 i)'~£ / (-<».^](^) 
i=l \ j=l j=l / 

1 

= t Y,hA(n + 1-R?,n + 1- Rf), 



k . 

i=i 



where Ui := 1 - Fi(Xi), V t := 1 - F 2 (Fj), for i = 1,2, . . . ,n. Note that, for 
x,y> 0, 



1 n 



A n ([0,x) x [0,y)) = ^^ / {^<Qm(fca ; /n),y i <Q2n(fcs//n 



)}■ 



So A„([0,x) x [0,oo]) = {\kx\ - l)/k < [kx]/k = A n ([0,x] x [0,oo]) a.s. and 
A n ([0,cx)] x [0,y)) = (rfel/l-l)/*!<[A;y]/Ar = A n ([0,oo] x [0,y]) a.s. 

The final and main corollary deals with the random measures A n , where 
the functions derived from A n , like A n , are defined as before. In particular, 
we define Q\ n as the quantile function of the random variable on the left- 
hand side of (4.6), conditional on A n , so it is also random. 

Corollary 4.3. Let A n be as in (4.7). Under the conditions of The- 
orem 2.3, we have, for each < [5 < 3 and each continuity point 1 — a 
(0<a<l) of Qa, that 

P 

<2A n (l - ®) — > Qa(1 - a) as co. 

For testing purposes, Corollary 4.3 shows that simulation of the limiting 
random variable in Theorem 2.3 with A replaced by the estimated A n is 
asymptotically correct. 

Now we turn to the proofs. In order to prove Proposition 4.1, by Pro- 
horov's theorem, it is necessary and sufficient to prove the following: 
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(i) The finite-dimensional distributions of {(A n (x,y) + B n (x,y))/(x V 
y) v , (x,y) £ [0, 1] 2 }„>i converge to those of {(A(x,y)+B(x,y))/(xVy) T >, (x,y) £ 

[o,i] 2 }; 

(ii) {(A n (x,y) + B n (x,y))/(x V y) 11 , (x,y) G [0, 1] 2 }„>i is relatively com- 
pact. 

For the relative compactness, we need several lemmas. These lemmas 
and their proofs can be found in a separate Appendix, posted at 
http : //center .uvt . nl/staf f /einmahl/AppEdHL .pdf , or in [10], pages 81- 
87. These lemmas lead to the following results: Under the conditions of 
Proposition 4.1, for each < rj < 1/2, 

^^,(x,y)G[0,l] 2 l 

is relatively compact, and for each < 77 < 1, 

A n (x,y) , x r„ -,,2 

7 — , (x,y) e 0,1 



n>l 



is relatively compact. 



Proof of Proposition 4.1. By these results, 

(a o\ f A n (x,y) + B n (x,y) 2 
(4-8) ( ,(x,y)e[0,l] 



n>l 



is relatively compact. It is easy to check that the finite-dimensional distri- 
butions of our estimated processes in (4.8) converge to those of the limiting 
process, which completes the proof. □ 

Proof of Corollary 4.1. After applying a Skorohod construction to 
the weak convergence statement of Proposition 4.1, the proof is similar to 
that of Theorem 2.3. □ 



Proof of Corollary 4.2. Proposition 4.1 implies the weak conver- 
gence of the distribution function of the left-hand side of (4.6) to the distri- 
bution function of the right-hand side of (4.6). This property carries over to 
the inverse functions Q\ n and Q\. □ 

Proof of Corollary 4.3. From another Skorohod construction, we 
obtain an a.s. version of the statement of Theorem 2.2; without changing 
the notation, we now work with this construction. Since, for < x,y < 1, 



A([0,x] x [0,y])=x + y-l(x,y), 
A n ([0,x] x [0,i/]) = \kx\/k+ \ky]/k-l 2 (x,y) -S n (x,y)/k 
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[S n (x,y) takes values in {0, 1,2}], it follows that, for each e > 0, 

(4.9) sup k l ^ £ \A n ([0,x] x [0,y])-A([0,ac] x [0,y])hfl a.s., 

0<x,y<l 

as n — ► oo. 

We now show that (4.2), (4.3), (4.4) and (4.5) hold a.s. We already saw, 
below (4.7), that (4.2) holds a.s. and the a.s. version of (4.3) follows imme- 
diately from (4.9). 

By (4.9) and (4.2), it easily follows that 

(4.10) sup k l / 2 ~ £ \k n (E) - k{E)\ a.s., 

as n — ► oo, where £ := {E\E = [xi,X2] x [s/i , 2/2] > < x\ < xi < 2,0 < y\ < 
y 2 < 2}. Let E n (x) = [x - k~ 1 / 6 ,x + A; -1 / 6 ] x [1 - k~ 1 / 6 ,l + fc" 1 / 6 ]. Then, 
setting \(u, v) = if u < 0, 

sup |A n (x,l) - A(x,l)| 

0<x<l 

< sup Ik^KiEnix)) - A(E n (x))\ + sup ||A: 1/3 A(S n (x))-A(x,l)| 

0<x<l 0<x<l 

< sup Ik^KiEnix)) - A(E n (x))\ 

0<x<l 

+ sup sup |A(ii, f ) — A(x, 1)| — > a.s., 

0<a;<l ( W ,u)e-B n (a;) 

as re — > 00, by (4.10) and the uniform continuity of A on [—1,2] x [5, 2] [which 
follows from A(0, 1) = 0]. The proofs of sup 0<y<1 |A n (l,y) — A(l, y)\ — > a.s. 
and sup 0<x y<1 \Rj n (x, y) — Rj(x,y)\ — > 0, j = 1,2, a.s. are similar. Hence, 
(4.4) and (4.5) hold a.s. 

According to Corollary 4.2 we have Qa„(1 — a) — > Qa(1 — a) a.s., as n — > 
00, hence, also in probability. □ 

5. Simulation study and real data application. In this section we present 
a simulation study, making use of the results of Section 4. We will consider 
two distributions satisfying the domain of attraction condition and one that 
fails to satisfy it. At the end of the section we will apply our procedure to 
financial data. 

Theoretically, we can choose any f3 S [0,3) in the test statistic in (1.12). 
We investigate the influence of (3 on the testing procedure by sampling from 
the bivariate Cauchy distribution. We choose f3 to be 0, 1 or 2. 

Consider the bivariate Cauchy distribution restricted to the first quadrant, 
with density 

2 

7r(l + x z + y z ) A l z 
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Table 1 

Quantiles of the limiting r.v. for the bivariate Cauchy distribution 



P 





0.10 


0.25 


0.50 


0.75 


0.90 


0.95 


0.975 


0.99 





0.018 


0.025 


0.038 


0.065 


0.106 


0.142 


0.177 


0.227 


1 


0.030 


0.041 


0.062 


0.103 


0.168 


0.222 


0.278 


0.356 


2 


0.074 


0.099 


0.144 


0.224 


0.347 


0.447 


0.554 


0.699 



It readily follows that 



This distribution satisfies the conditions of Theorem 2.3; in particular, 
(2.5) holds with a = 2 (see [8], pages 1409-1410). First we present in Table 1 
the quantiles of the limiting random variable 



We used 1,000,000 replications. With high probability, these quantiles are 
accurate up to 0.01. 

Now for sample size n = 2000, we simulated 2000 times the test statis- 
tic kL n , for various values of k. Using the 0.95th quantiles above, we find 
the simulated type-I error probabilities; see Table 2. In this table also the 
empirical median and the empirical 0.95th quantile of the test statistics are 
shown. In the ideal situation the number of rejections is a binomial r.v. with 
parameters 2000 and 0.05. So the simulated type-I errors in Table 2 are re- 
markably close to 0.05. Only for k = 400 does bias seem to set in. Also, the 
empirical median and 0.95th quantile of the test statistics are very close to 
those of the limiting r.v. listed in Table 1. Generally speaking, the influence 
of (3 on the quality of the results is very small for the Cauchy distribution. 
From Table 2, we feel that (3 = 2 works slightly better than the others. Be- 
cause of this and because we want to put additional emphasis on the extreme 
observations, from now on we take (3 = 2. 

In practice, for a given dataset, we first calculate the test statistic kL n ; 
then we estimate the measure A n and simulate the 0.95th quantile of the 
estimated limiting r.v. using the approximation of Section 4. Finally, if the 
test statistic is not smaller than this 0.95th quantile, we reject the null 
hypothesis (1.8). 

First we consider again the bivariate Cauchy distribution and take two 
samples of size n = 2000. The results are presented in Figure 1. Note that 




X(x,y) 



xy 



x,y>0. 



(x 2 + y 2 ) 3 / 2 
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Fig. 1. C'auchy distribution: test statistic and 0.95th quantile for two samples. 



the behavior of the test statistic and the estimated 0.95th quantile fluctuate 
with the sample fraction k, but that, for all k in the figure, the value is — 
correctly — far below the estimated 0.95th quantile of the limiting random 
variable. 

Next we generate 2000 independent pairs (U, 1 — V), where (U, V) has a 
Gumbel copula as distribution function, that is, the d.f. is given by 

C(u, v) = exp(-[(- logu) e + (- logv) > 1; 

we take 9 = 10. It is easily checked that, for the d.f. of (U, 1 — V), (1.1) holds 
and that we have asymptotic independence; see Remark 2.2. Since our results 
do not apply for the case of asymptotic independence, we only present the 
test statistic itself (Figure 2, left panel). We see that, for k up to 200, the test 
statistic is very close to (which strongly supports Hq) and that bias sets in 

Table 2 

Simulated type-I error, median and 0.95th quantile of the test statistics for the Cauchy 

d.f.; n = 2000, a = 0.05 



(3 k 20 40 60 80 100 125 150 175 200 300 350 400 

a 0.041 0.045 0.047 0.044 0.038 0.047 0.047 0.034 0.035 0.049 0.048 0.060 

Q(0.95) 0.134 0.135 0.139 0.132 0.129 0.139 0.139 0.127 0.125 0.141 0.140 0.153 
Q(0.50) 0.036 0.038 0.036 0.036 0.036 0.036 0.036 0.036 0.036 0.040 0.040 0.047 

a 0.041 0.047 0.047 0.045 0.039 0.050 0.044 0.036 0.034 0.054 0.046 0.061 

1 Q(0.95) 0.208 0.213 0.216 0.210 0.210 0.220 0.216 0.203 0.203 0.226 0.216 0.236 
Q(0.50) 0.059 0.061 0.059 0.059 0.059 0.060 0.059 0.058 0.058 0.065 0.064 0.076 

a 0.047 0.042 0.049 0.048 0.044 0.047 0.044 0.044 0.042 0.053 0.050 0.068 

2 Q(0.95) 0.434 0.423 0.444 0.442 0.430 0.437 0.431 0.431 0.416 0.463 0.446 0.503 
Q(0.50) 0.133 0.138 0.137 0.135 0.137 0.141 0.138 0.143 0.143 0.156 0.156 0.195 
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Fig. 2. Test statistic transformed Gumbel copula (left) and alternative distribution 
(right). 



dally returns 1991-2003 



9 c 


O O ° g 









Fig. 3. Daily equity returns of two Dutch banks (left) and test statistic and 0.95th quan- 
tile ( right ). 



for larger values of k. We also consider 2000 observations from a d.f. (which 
is also a copula), which does not satisfy condition (1.8). The distribution is 
an adaptation of a distribution in [15]: take a density of 3/2 on the following 
rectangles: [2~( 2m+1 ),2-( 2m )] x [2"( 2r+1 ), 2"( 2r )], for m = 0,l,2,... and r = 
0, 1, 2, . . .; in this way a probability mass of 2/3 is assigned. The remaining 
1 /3 is assigned by taking the uniform distribution on the line segments from 
(• 2 -(2m+2) )2 -(2m+2)) tQ ^-( 2 m+l) ^-(2m+l)^ m = 0,1,2,..., Such that the 

mass of the mth segment is equal to 2~( 2m+2 ). In Figure 2 (right panel), 
we see, for varying k, the test statistic and simulated 0.95th quantile of the 
sample of size n = 2000 from this distribution. Again, the test statistic and 
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the estimated 0.95th quantile fluctuate with k, but from a certain k on (and 
for most values of k), the null hypothesis is clearly rejected. 

Finally, we apply the test to real data, similarly as we just did for the 
simulated data sets in Figures 1 and 2. The data are 3283 daily logarithmic 
equity returns over the period 1991-2003 for two Dutch banks, ING and 
ABN AMRO bank. The bivariate, heavy-tailed data are shown in Figure 3 on 
the left; on the right we see again the test statistic and the simulated 0.95th 
quantile. Since the test statistic is everywhere clearly below the quantile, 
we cannot reject the null hypothesis. This is a satisfactory result, because 
it allows us to analyze these data further, using the statistical theory of 
extremes. 
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